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(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Limbach & Limbach 

(B) STREET: 2001 Ferry Building 

(C) CITY: San Francisco 

(D) STATE : CAL 
(F) ZIP: 94111 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patent in Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 923,692 

(B) FILING DATE: 31 -JUL- 1992 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 600,244 

(B) FILING DATE: 22 -OCT- 1990 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 641,617 

(B) FILING DATE: 16-JAN-1991 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 310,881 

(B) FILING DATE: 17-FEB-1989 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 160,766 

(B) FILING DATE: 26-FEB-1988 



(vii) 



PRIOR APPLICATION DATA: 



PATENT 



3- 



(A) APPLICATION NUMBER: US 160,771 
<B) FILING DATE: 26 -FEB- 19 88 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 347,637 

(B) FILING DATE: 05-MAY-1989 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 363,138 

(B) FILING DATE: 08-JUN-19 89 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 219,279 

(B) FILING DATE: 15 -JUL- 1988 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Halluin, Albert P. 

(B) REGISTRATION NUMBER: 2 8,957 

(C) REFERENCE/DOCKET NUMBER: BIOG-20121 USA 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-433-4150 

(B) TELEFAX: 415-433-8716 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(iii) 



HYPOTHETICAL : NO 



PATENT 

-4- 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Pro Xaa Gly Pro 
l 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

GGGTACCTGG GCC 13 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chinese cucumber 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: alpha- trichosanthin 

(ix) FEATURE: 

(A) NAME/KEY: CDS (B) LOCATION: 8. .877 

(B) LOCATION: 8. .877 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCGAGG ATG ATC AGA TTC TTA GTC CTC TCT TTG CTA ATT CTC ACC CTC 49 

Met lie Arg Phe Leu Val Leu Ser Leu Leu lie Leu Thr Leu 
15 10 

TTC CTA ACA ACT CCT GCT GTG GAG GGC GAT GTT AGC TTC CGT TTA TCA 97 

Phe Leu Thr Thr Pro Ala Val Glu Gly Asp Val Ser Phe Arg Leu Ser 
15 20 25 30 

GGT GCA ACA AGC AGT TCC TAT GGA GTT TTC ATT TCA AAT CTG AGA AAA 145 

Gly Ala Thr Ser Ser Ser Tyr Gly Val Phe lie Ser Asn Leu Arg Lys 
35 40 45 

GCT CTT CCA AAT GAA AGG AAA CTG TAC GAT ATC CCT CTG TTA CGT TCC 193 

Ala Leu Pro Asn Glu Arg Lys Leu Tyr Asp lie Pro Leu Leu Arg Ser 
50 55 60 




TCT CTT CCA GGT TCT CAR 

Ser Leu Pro Gly Ser Gin 
65 

GCC GAT GAA ACC ATT TCA 

Ala Asp Glu Thr lie Ser 
80 

ATG GGA TAT CGC GCT GGC 

Met Gly Tyr Arg Ala Gly 
95 100 

GCA ACA GAA GCT GCA AAA 

Ala Thr Glu Ala Ala Lys 
115 

ACG CTT CCA TAT TCT GGC 

Thr Leu Pro Tyr Ser Gly 
130 

AAA ATA AGG GAA AAT ATT 

Lys lie Arg Glu Asn lie 
145 

ATT ACC ACT TTG TTT TAC 

lie Thr Thr Leu Phe Tyr 
160 
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CGC TAC GCA TTG ATC CAT 

Arg Tyr Ala Leu lie His 
70 

GTG GCC ATA GAC GTA ACG 

Val Ala lie Asp val Thr 
85 90 

GAT ACA TCC TAT TTT TTC 

Asp Thr Ser Tyr Phe Phe 
105 

TAT GTA TTC AAA GAC GCT 

Tyr Val Phe Lys Asp Ala 
120 

AAT TAC GAA AGG CTT CAA 

Asn Tyr Glu Arg Leu Gin 
135 

CCG CTT GGA CTC CCA GCT 

Pro Leu Gly Leu Pro Ala 
150 

TAC AAC GCC AAT TCT GCT 

Tyr Asn Ala Asn Ser Ala 
165 170 
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CTC ACA AAT TAC 241 

Leu Thr Asn Tyr 
75 

AAC GTC TAT ATT 289 
Asn Val Tyr lie 

AAC GAG GCT TCT 337 

Asn Glu Ala Ser 
110 

ATG CGA AAA GTT 3 85 

Met Arg Lys Val 
125 

ACT GCT GCG GGC 433 

Thr Ala Ala Gly 
140 

TTG GAC AGT GCC 481 

Leu Asp Ser Ala 
155 

GCG TCG GCA CTT 529 
Ala Ser Ala Leu 
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ATG GTA CTC ATT CAG TCG ACG TCT GAG GCT GCG AGG TAT AAA TTT ATT 577 

Met Val Leu lie Gin Ser Thr Ser Glu Ala Ala Arg Tyr Lys Phe lie 
175 180 185 190 

GAG CAA CAA ATT GGG AAG CGC GTT GAC AAA ACC TTC CTA CCA AGT TTA 625 

Glu Gin Gin lie Gly Lys Arg Val Asp Lys Thr Phe Leu Pro Ser Leu 
195 200 205 

GCA ATT ATA AGT TTG GAA AAT AGT TGG TCT GCT CTC TCC AAG CAA ATT 673 

Ala He He Ser Leu Glu Asn Ser Trp Ser Ala Leu Ser Lys Gin lie 
210 215 220 

CAG ATA GCG AGT ACT AAT AAT GGA CAG TTT GAA ACT CCT GTT GTG CTT 721 

Gin He Ala Ser Thr Asn Asn Gly Gin Phe Glu Thr Pro Val Val Leu 
225 230 235 

ATA AAT GCT CAA AAC CAA CGA GTC ATG ATA ACC AAT GTT GAT GCT GGA 769 

lie Asn Ala Gin Asn Gin Arg Val Met He Thr Asn Val Asp Ala Gly 
240 245 250 

GTT GTA ACC TCC AAC ATC GCG TTG CTG CTG AAT CGA AAC AAT ATG GCA 817 

Val Val Thr Ser Asn He Ala Leu Leu Leu Asn Arg Asn Asn Met Ala 
255 260 265 270 

GCC ATG GAT GAC GAT GTT CCT ATG ACA CAG AGC TTT GGA TGT GGA AGT 865 

Ala Met Asp Asp Asp Val Pro Met Thr Gin Ser Phe Gly Cys Gly Ser 
275 280 285 

TAT GCT ATT TAGTAACTCG AG 886 



-8- 
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Tyr Ala lie 

290 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met lie Arg Phe Leu Val Leu Ser Leu Leu lie Leu Thr Leu Phe Leu 
15 10 15 

Thr Thr Pro Ala Val Glu Gly Asp Val Ser Phe Arg Leu Ser Gly Ala 
20 25 30 

Thr Ser Ser Ser Tyr Gly Val Phe lie Ser Asn Leu Arg Lys Ala Leu 
35 40 45 

Pro Asn Glu Arg Lys Leu Tyr Asp lie Pro Leu Leu Arg Ser Ser Leu 
50 55 60 

Pro Gly Ser Gin Arg Tyr Ala Leu lie His Leu Thr Asn Tyr Ala Asp 
65 70 75 80 

Glu Thr lie Ser Val Ala lie Asp Val Thr Asn Val Tyr lie Met Gly 
85 90 95 

Tyr Arg Ala Gly Asp Thr Ser Tyr Phe Phe Asn Glu Ala Ser Ala Thr 
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100 105 110 

Glu Ala Ala Lys Tyr Val Phe Lys Asp Ala Met Arg Lys Val Thr Leu 
115 120 125 

Pro Tyr Ser Gly Asn Tyr Glu Arg Leu Gin Thr Ala Ala Gly Lys lie 
130 135 140 

Arg Glu Asn He Pro Leu Gly Leu Pro Ala Leu Asp Ser Ala He Thr 
145 150 155 160 

Thr Leu Phe Tyr Tyr Asn Ala Asn Ser Ala Ala Ser Ala Leu Met Val 
165 170 175 

Leu He Gin Ser Thr Ser Glu Ala Ala Arg Tyr Lys Phe He Glu Gin 
180 185 190 

Gin He Gly Lys Arg Val Asp Lys Thr Phe Leu Pro Ser Leu Ala He 
195 200 205 

He Ser Leu Glu Asn Ser Trp Ser Ala Leu Ser Lys Gin He Gin He 
210 215 220 

Ala Ser Thr Asn Asn Gly Gin Phe Glu Thr Pro Val Val Leu He Asn 
225 230 235 240 

Ala Gin Asn Gin Arg Val Met He Thr Asn Val Asp Ala Gly Val Val 
245 250 255 

Thr Ser Asn He Ala Leu Leu Leu Asn Arg Asn Asn Met Ala Ala Met 
260 265 270 

Asp Asp Asp Val Pro Met Thr Gin Ser Phe Gly Cys Gly Ser Tyr Ala 
275 280 285 



He 




(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: alpha-amylase 

(ix) FEATURE: 

(A) NAME/KEY: CDS (B) LOCATION: 12. .1316 

(B) LOCATION: 12. .1316 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCTCGAGGTG C ATG CAG GTG CTG AAC ACC ATG GTG AAC A CAC TTC TTG 48 

Met Gin Val Leu Asn Thr Met Val Asn Lys His Phe Leu 
15 10 

TCC CTT TCG GTC CTC ATC GTC CTC CTT GGC CTC TCC TCC AAC TTG ACA 96 

Ser Leu Ser Val Leu lie Val Leu Leu Gly Leu Ser Ser Asn Leu Thr 



PATENT 

-il- 



ls 20 25 

GCC GGG CAA GTC CTG TTT CAG GGA TTC AAC TGG GAG TCG TGG AAG GAG 144 

Ala Gly Gin Val Leu Phe Gin Gly Phe Asn Trp Glu Ser Trp Lys Glu 
30 35 40 45 

AAT GGC GGG TGG TAC AAC TTC CTG ATG GGC AAG GTG GAC GAC ATC GCC 192 

Asn Gly Gly Trp Tyr Asn Phe Leu Met Gly Lys Val Asp Asp lie Ala 
50 55 60 

GCA GCC GGC ATC ACC CAC GTC TGG CTC CCT CCG CCG TCT CAC TCT GTC 240 

Ala Ala Gly lie Thr His Val Trp Leu Pro Pro Pro Ser His Ser Val 
65 70 75 

GGC GAG CAA GGC TAC ATG CCT GGG CGG CTG TAC GAT CTG GAC GCG TCT 288 

Gly Glu Gin Gly Tyr Met Pro Gly Arg Leu Tyr Asp Leu Asp Ala Ser 
80 85 90 

AAG TAC GGC AAC GAG GCG CAG CTC AAG TCG CTG ATC GAG GCG TTC CAT 33 6 

Lys Tyr Gly Asn Glu Ala Gin Leu Lys Ser Leu lie Glu Ala Phe His 
95 100 105 

GGC AAG GGC GTC CAG GTG ATC GCC GAC ATC GTC ATC AAC CAC CGC ACG 384 

Gly Lys Gly Val Gin Val He Ala Asp He Val He Asn His Arg Thr 
110 115 120 125 

GCG GAG CAC AAG GAC GGC CGC GGC ATC TAC TGC CTC TTC GAG GGC GGG 432 

Ala Glu His Lys Asp Gly Arg Gly He Tyr Cys Leu Phe Glu Gly Gly 
130 135 140 



ACG CCC GAC TCC CGC CTC 

Thr Pro Asp Ser Arg Leu 
145 

GAC CCC TAC GGC CAT GGC 

Asp Pro Tyr Gly Asp Gly 
160 

GCC GCC GCG CCG GAC ATC 

Ala Ala Ala Pro Asp lie 
175 

CTC ATT GGC TGG CTC GAC 

Leu lie Gly Trp Leu Asp 
190 195 

TGG CGC CTC GAC TTC GCC 

Trp Arg Leu Asp Phe Ala 
210 

TAC ATC GAC GCC ACC GAG 

Tyr He Asp Ala Thr Glu 

225 

TCC ATG GCG AAC GGC GGG 

Ser Met Ala Asn Gly Gly 
240 
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GAC TGG GGC CCG CAC ATG 

Asp Trp Gly Pro His Met 
150 

ACC GGC AAC CCG GAC ACC 

Thr Gly Asn Pro Asp Thr 
165 

GAC CAC CTC AAC AAG CGC 

Asp His Leu Asn Lys Arg 
180 185 

TGG CTC AAG ATG GAC ATC 

Trp Leu Lys Met Asp He 
200 

AAG GGC TAC TCC GCC GAC 

Lys Gly Tyr Ser Ala Asp 
215 

CCG AGC TTC GCC GTG CCC 

Pro Ser Phe Ala Val Ala 
230 

GAC GGC AAG CCG AAC TAC 

Asp Gly Lys Pro Asn Tyr 
245 
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ATC TGC CGC GAC 480 

He Cys Arg Asp 
155 

GGC GCC GAC TTC 528 

Gly Ala Asp Phe 
170 

GTC CAG CGG GAG 576 
Val Gin Arg Glu 

GGC TTC GAC GCG 624 

Gly Phe Asp Ala 

205 

ATG GCA AAC ATC 672 

Met Ala Lys lie 
220 

GAG ATA TCG ACG 720 

Glu He Trp Thr 
235 

GAC CAG AAC GCG 768 

Asp Gin Asn Ala 

250 




CAC CGG CAG GAG CTG GTC 

His Arg Gin Glu Leu Val 
255 

ACC AAC GGC ACG GCG TTC 

Ser Asn Gly Thr Ala Phe 
270 275 

GCC GTG GAG GGC GAG CTG 

Ala Val Glu Gly Glu Leu 
290 

CCC GGC ATG ATC GGG TGC 

Pro Gly Met lie Gly Trp 
305 

AAC CAC GAC ACC GGC TCG 

Asn His Asp Thr Gly Ser 
320 

AAG GTC ATG CAG GGC TAC 

Lys Val Met Gin Gly Tyr 
335 

TGC ATC TTG TAC GAC CAT 

Cys lie Phe Tyr Asp His 
350 355 

GAG CGC CTG GTG TCA ATC 
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AAC TGG GTC GAT CGT GTC 

Asn Trp Val Asp Arg Val 
260 265 

GAC TTC ACC ACC AAG GGC 

Asp Phe Thr Thr Lys Gly 
280 

TGG CGC CTC CGC GGC GAG 

Trp Arg Leu Arg Gly Glu 
295 

TGG CCG GCC AAG GCG ACG 

Trp Pro Ala Lys Ala Thr 
310 

ACG CAG CAC CTG TGG CCG 

Thr Gin His Leu Trp Pro 
325 

GCA TAC ATC CTC ACC CAC 

Ala Tyr lie Leu Thr His 
340 345 

TTC TTC GAT TGG GGT CTC 

Phe Phe Asp Trp Gly Leu 
360 

AGA AAC CGG CAG GGG ATC 
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GGC GGC GCC AAC 816 
Gly Gly Ala Asn 

ATC CTC AAC GTC 864 

lie Leu Asn Val 
285 

GAC GGC AAG GCG 912 

Asp Gly Lys Ala 
300 

ACC TTC GTC GAC 960 

Thr Phe Val Asp 
315 

TTC CCC TCC GAC 1008 

Phe Pro Ser Asp 

330 

CCC GGC AAC CCA 1056 
Pro Gly Asn Pro 

AAG GAG GAG ATC 1104 

Lys Glu Glu lie 
365 

CAC CCG GCG AGC 1152 
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Glu Arg Leu Val Ser He Arg Asn Arg Gin Gly He His Pro Ala Ser 
370 375 380 

GAG CTG CGC ATC ATG GAA GCT GAC AGC GAT CTC TAC CTC GCG GAG ATC 1200 

Glu Leu Arg He Met Glu Ala Asp Ser Asp Leu Tyr Leu Ala Glu lie 
385 390 395 

GAT GGC AAG GTG ATC ACA AAG ATT GGA CCA AGA TAC GAC GTC GAA CAC 124 8 

Asp Gly Lys Val He Thr Lys He Gly Pro Arg Tyr Asp Val Glu His 
400 405 410 

CTC ATC CCC GAA GGC TTC CAG GTC GTC GCG CAC GGT GAT GGC TAC GCA 1296 

Leu He Pro Glu Gly Phe Gin Val Val Ala His Gly Asp Gly Tyr Ala 
415 420 425 

ATC TGG GAG AAA ATC TGAGCGCACG ATGACGAGAC TCTCAGTTTA GCAGATTTAA 1351 

He Trp Glu Lys Lie 

430 435 

CCTGCGATTT TTACCCTGAC CGGTATACGT ATATACGTGC CGGCAACGAG CTGTATCCGA 1411 



TCCGAATTAC GGATGCAATT GTC CACGAAG TCCTCGAGG 1450 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 
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(D) Topology: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Gin Val Leu Asn Thr Met Val Asn Lys His Phe Leu Ser Leu Ser 
15 10 15 

Val Leu lie Val Leu Leu Gly Leu Ser Ser Asn Leu Thr Ala Gly Gin 
20 25 30 

Val Leu Phe Gin Gly Phe Asn Trp Glu Ser Trp Lys Glu Asn Gly Gly 
35 40 45 

Trp Tyr Asn Phe Leu Met Gly Lys Val Asp Asp lie Ala Ala Ala Gly 
50 55 60 

lie Thr His Val Trp Leu Pro Pro Pro Ser His Ser Val Gly Glu Gin 
65 70 75 80 

Gly Tyr Met Pro Gly Arg Leu Tyr Asp Leu Asp Ala Ser Lys Tyr Gly 
85 90 95 

Asn Glu Ala Gin Leu Lys Ser Leu lie Glu Ala Phe His Gly Lys Gly 
100 105 110 

Val Gin Val lie Ala Asp lie Val lie Asn His Arg Thr Ala Glu His 
115 120 125 

Lys Asp Gly Arg Gly He Tyr Cys Leu Phe Glu Gly Gly Thr Pro Asp 
130 135 140 

Ser Arg Leu Asp Trp Gly Pro His Met He Cys Arg Asp Asp Pro Tyr 
145 150 155 160 



1 
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Gly Asp Gly Thr Gly Asn Pro Asp Thr Gly Ala Asp Phe Ala Ala Ala 
165 170 175 

Pro Asp He Asp His Leu Asn Lys Arg Val Gin Arg Glu Leu He Gly 
180 185 190 

Trp Leu Asp Trp Leu Lys Met Asp lie Gly Phe Asp Ala Trp Arg Leu 
195 200 205 

Asp Phe Ala Lys Gly Tyr Ser Ala Asp Met Ala Lys He Tyr He Asp 
210 215 220 

Ala Thr Glu Pro Ser Phe Ala Val Ala Glu He Trp Thr Ser Met Ala 
225 230 235 240 

Asn Gly Gly Asp Gly Lys Pro Asn Tyr Asp Gin Asn Ala His Arg Gin 
245 250 255 

Glu Leu Val Asn Trp Val Asp Arg Val Gly Gly Ala Asn Ser Asn Gly 
260 265 270 

Thr Ala Phe Asp Phe Thr Thr Lys Gly He Leu Asn Val Ala Val Glu 
275 280 285 

Gly Glu Leu Trp Arg Leu Arg Gly Glu Asp Gly Lys Ala Pro Gly Met 
290 295 300 

He Gly Trp Trp Pro Ala Lys Ala Thr Thr Phe Val Asp Asn His Asp 
305 310 315 320 

Thr Gly Ser Thr Gin His Leu Trp Pro Phe Pro Ser Asp Lys Val Met 
325 330 335 



Gin Gly Tyr Ala Tyr He Leu Thr His Pro Gly Asn Pro Cys He Phe 
340 345 350 
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Tyr Asp His Phe Phe Asp Trp Gly Leu Lys Glu Glu He Glu Arg Leu 
355 360 365 

Val Ser He Arg Asn Arg Gin Gly He His Pro Ala Ser Glu Leu Arg 
370 375 380 

He Met Glu Ala Asp Ser Asp Leu Tyr Leu Ala Glu He Asp Gly Lys 
385 390 395 400 

Val He Thr Lys He Gly Pro Arg Tyr Asp Val Glu His Leu He Pro 
405 410 415 

Glu Gly Phe Gin Val Val Ala His Gly Asp Gly Tyr Ala He Trp Glu 
420 425 430 

Lys He 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 
(G) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to ItlRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(vii) 



IMMEDIATE SOURCE: 
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(B) CLONE: alpha -hemoglobin 

(ix) FEATURE : 

(A) NAME/KEY: trans it_peptide (B) LOCATION : 26. .241 

(B) LOCATION: 26. .241 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 245. .670 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 

CTCGAGGGCA TCTGATCTTT CAAGAATGGC ACAAATTAAC AACATGGCAC AAGGGATACA 60 

AACCCTTAAT CCCAATTCCA ATTTCCATAA ACCCCAAGTT CCTAAATCTT CAAGTTTTCT 120 

TGTTTTTGGA TGTAAAAAAC TGAAAATTC AGCAAATTCT ATGTTGGTTT TGAAAAAAGA 180 

TTCAATTTTT ATGCAAAAGT TTTGTTCCTT TAGGATTTCA GCAGGTGGTA GAGTTTCTTG 24 0 

CATG GTG CTG TCT CCT GCC GAC AAG ACC AAC GTC AAG GCC GCC TGG GGC 289 

Val Leu Ser Pro Ala Asp Lys Thr Asn Val Lys Ala Ala Trp Cly 
15 10 15 

AAG GTT GGC GCG CAC GCT GGC GAG TAT GGT GCG GAG GCC CTG GAG AGG 337 

Lys Val Gly Ala His Ala Gly Glu Tyr Gly Ala Glu Ala Leu Glu Arg 
20 25 30 

ATG TTC CTG TCC TTC CCC ACC ACC AAG ACC TAC TTC CCG CAC TTC GAC 385 

Met Phe Leu Ser Phe Pro Thr Thr Lys Thr Tyr Phe Pro His Phe Asp 
35 40 45 

CTG AGC CAC GGC TCT GCC CAG GTT AAG GGC CAC GGC AAG AAG GTG GCC 433 
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Leu Ser His Gly Ser Ala Gin Val Lys Gly His Gly Lys Lys Val Ala 
50 55 60 

GAC GCG CTG ACC AAC GCC GTG GCG CAC GTG GAC GAC ATG CCC AAC GCG 481 

Asp Ala Leu Thr Asn Ala Val Ala His Val Asp Asp Met Pro Asn Ala 
65 70 75 

CTG TCC GCC CTG AGC GAC CTG CAC GCG CAC AAG CTT CGG GTG GAC CCG 529 

Leu Ser Ala Leu Ser Asp Leu His Ala His Lys Leu Arg Val Asp Pro 
80 85 90 95 

GTC AAC TTC AAG CTC CTA AGC CAC TGC CTG CTG GTG ACC CTG GCC GCC 577 

Val Asn Phe Lys Leu Leu Ser His Cys Leu Leu Val Thr Leu Ala Ala 
100 105 110 

CAC CTC CCC GCC GAG TTC ACC CCT GCG GTG CAC GCC TCC CTG GAC AAG 625 

His Leu Pro Ala Glu Phe Thr Pro Ala Val His Ala Ser Leu Asp Lys 
115 120 125 

TTC CTG GCT TCT GTG AGC ACC GTG CTG ACC TCC AAA TAC CGT TAAGCTGGAG 677 

Phe Leu Ala Ser Val Ser Thr Val Leu Thr Ser Lys Tyr Arg 
130 135 140 



CCTCGGTAGC CGTTCCTCCT GCCCGGTCGA CC 709 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) SEQUE1 

Val Leu Ser Pro 
1 

Val Gly Ala His 
20 

Phe Leu Ser Phe 
35 

Ser His Gly Ser 
50 

Ala Leu Thr Asn 
65 

Ser Ala Leu Ser 



Asn Phe Lys Leu 
100 

Leu Pro Ala Glu 
115 



ICE DESCRIPTION: 

Ala Asp Lys Thr 
5 

Ala Gly Glu Tyr 

Pro Thr Thr Lys 
40 

Ala Gin Val Lys 
55 

Ala Val Ala His 
70 

Asp Leu His Ala 
85 

Leu Ser His Cys 

Phe Thr Pro Ala 
120 



SEQ ID NO : 8 : 

Asn Val Lys Ala 
10 

Gly Ala Glu Ala 

25 

Thr Tyr Phe Pro 

Gly His Gly Lys 
60 

Val Asp Asp Met 
75 

His Lys Leu Arg 
90 

Leu Leu Val Thr 
105 

Val His Ala Ser 



Ala Trp Gly Lys 
15 

Leu Glu Arg Met 
30 

His Phe Asp Leu 
45 

Lys Val Ala Asp 

Pro Asn Ala Leu 
80 

Val Asp Pro Val 
95 

Leu Ala Ala His 
110 

Leu Asp Lys Phe 
125 
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Leu Ala Ser Val Ser Thr Val Leu Thr Ser Lys Tyr Arg 
130 135 .140 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: beta-hemoglobin 

(ix) FEATURE: 

(A) NAME/KEY: t ran sit_pep tide (B) LOCATION: 26. .241 

(B) LOCATION: 26.. 241 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 245 .. 685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



CTCGAGGGGA TCTGATCTTT CAAGAATGGC ACAAATTAAC AACATGGCAC AAGGGATACA 
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AAC C CTTAAT CCCAATTCCA ATTTCCATAA ACCCCAAGTT CCTAAATCTT CAAGTTTTCT 120 

TGTTTTTGGA TCTAAAAAAC TGAAAAATTC AGCAAATTCT ATGTTGGTTT TGAAAAAAGA 180 

TTCAATTTTT ATG CAAAAGT TTTGTTCCTT TAGGATTTCA GCAGGTGGTA GAGTTTCTTG 24 0 

GATG GTG CAC CTG ACT CCT GAG GAG AAG TCT GCC GTT ACT GCC CTG TGG 2 89 

Val His Leu Thr Pro Glu Glu Lys Ser Ala Val Thr Ala Leu Trp 
15 10 15 

GGC AAG GTG AAC GTG GAT GAA GTT GGT GGT GAG GCC CTG GGC AGG CTG 337 

Gly Lys Val Asn Val Asp Glu Val Gly Gly Glu Ala Leu Gly Arg Leu 
20 25 30 

CTG GTG GTC TAC CCT TGG ACC GAG AGG TTC TTT GAG TCC TTT GGG GAT 3 85 

Leu Val Val Tyr Pro Trp Thr Gin Arg Phe Phe Glu Ser Phe Gly Asp 
35 40 45 

CTG TCC ACT CCT GAT GCT GTT ATG GGC AAC CCT AAG GTG AAG GCT CAT 4 33 

Leu Ser Thr Pro Asp Ala Val Met Gly Asn Pro Lys Val Lys Ala His 
50 55 60 

GGC AAG AAA GTG CTG GGT GCC TTT AGT GAT GGC CTG GCT CAC CTG GAC 4 81 

Gly Lys Lys Val Leu Gly Ala Phe Ser Asp Gly Leu Ala His Leu Asp 
65 70 75 

AAC CTC AAG GGC ACC TTT GCC ACC CTG AGT GAG CTG CAC TGT GAC AAG 529 

Asn Leu Lys Gly Thr Phe Ala Thr Leu Ser Glu Leu His Cys Asp Lys 
80 85 90 95 
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CTG CAC GTG GAT CCT GAG AGC TTC AGG CTC CTA GGC AAC GTG CTG GTC 577 

Leu His Val Asp Pro Glu Ser Phe Arg Leu Leu Gly Asn Val Leu Val 
100 105 110 

TGT GTG CTG GCG CAT CAC TTT GGC AAA GAA TTC ACC CCA CCA GTG CAG 625 

Cys Val Leu Ala His His Phe Gly Lys Glu Phe Thr Pro Pro Val Gin 
115 120 125 

GCT GCC TAT CAG AAA GTG GTG GCT GGT GTG GCT AAT GCC CTG GCC CAC 673 

Ala Ala Tyr Gin Lys Val Val Ala Gly Val Ala Asn Ala Leu Ala His 
130 135 140 

AAG TAT CAC TAAGCTCGCT TTCTTGCTGT CCAATTTCTA TTAAAGGTTC 722 

Lys Tyr His 
145 

CTTTGTGGGG TCGAGGTCGA C 743 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Val His Leu Thr Pro Glu Glu Lys Ser Ala Val Thr Ala Leu Trp Gly 
15 10 15 
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Lys Val Asn Val Asp Glu Val Gly Gly Glu Ala Leu Gly Arg Leu Leu 
20 25 30 

Val Val Tyr Pro Trp Thr Gin Arg Phe Phe Glu Ser Phe Gly Asp Leu 
35 40 45 

Ser Thr Pro Asp Ala Val Met Gly Asn Pro Lys Val Lys Ala His Gly 
50 55 60 

Lys Lys Val Leu Gly Ala Phe Ser Asp Gly Leu Ala His Leu Asp Asn 
65 70 75 80 

Leu Lys Gly Thr Phe Ala Thr Leu Ser Glu Leu His Cys Asp Lys Leu 
85 90 95 

His Val Asp Pro Glu Ser Phe Arg Leu Leu Gly Asn Val Leu Val Cys 
100 105 110 

Val Leu Ala His His Phe Gly Lys Glu Phe Thr Pro Pro Val Gin Ala 
115 120 125 

Ala Tyr Gin Lys Val Val Ala Gly Val Ala Asn Ala Leu Ala His Lys 
130 135 140 

Tyr His 
145 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: alkalophilic Bacillus sp . 

(B) STRAIN: 38-2 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: beta- cyclodextrin 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ala Pro Asp Thr Ser Val Ser Asn Lys Gin Asn Phe Ser Thr Asp Val 
15 10 15 

He 

REMARKS 

The present Amendment is in response to Examiner's 
Notice to Comply With Requirements For Patent 
Applications Containing Nucleotide Sequence And/Or 
Amino Acid Sequence Disclosures mailed April 8, 1993. 

Applicants submit this Amendment to complete their 
disclosure of the sequences employed in the instant 
application pursuant to 37 C.F.R. § 1.821 et seq. 
Applicants also submit herewith a paper copy of the 
Sequence Listing and a floppy disk containing the 
Sequence Listing in computer readable form in 
compliance with 37 C.F.R. §§ 1.821 (c) and (e) . 

Examiner will note that no new matter has been 
introduced by the instant amendment to the 
Specification or by these submissions and that the 
content of the amendment to the Specification and the 
paper and computer readable copies of the Sequence 
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Listing are the same, 
amendment is proper. 



Dated: 



Therefore, entry of the instant 



Respectfully submitted, 



Albert P. Halluin 
Registration No. 25,227 



Attorneys for Applicant 



